Using Pairs of Data - Points to De neSplits for Decision
نویسنده
چکیده
Conventional binary classiication trees such as CART either split the data using axis-aligned hyperplanes or they perform a compu-tationally expensive search in the continuous space of hyperplanes with unrestricted orientations. We show that the limitations of the former can be overcome without resorting to the latter. For every pair of training data-points, there is one hyperplane that is orthogonal to the line joining the data-points and bisects this line. Such hyperplanes are plausible candidates for splits. In a comparison on a suite of 12 datasets we found that this method of generating candidate splits outperformed the standard methods, particularly when the training sets were small.
منابع مشابه
Outlier Detection for Support Vector Machine using Minimum Covariance Determinant Estimator
The purpose of this paper is to identify the effective points on the performance of one of the important algorithm of data mining namely support vector machine. The final classification decision has been made based on the small portion of data called support vectors. So, existence of the atypical observations in the aforementioned points, will result in deviation from the correct decision. Thus...
متن کاملFinding Common Weights in Two-Stage Network DEA
In data envelopment analysis (DEA), mul-tiplier and envelopment CCR models eval-uate the decision-making units (DMUs) under optimal conditions. Therefore, the best prices are allocated to the inputs and outputs. Thus, if a given DMU was not efficient under optimal conditions, it would not be considered efficient by any other models. In the current study, using common weights in DEA, a number of...
متن کاملAn algorithm for the anchor points of the PPS of the CCR model
Anchor DMUs are a new class in the general classification of Decision Making Units (DMUs) in Data Envelopment Analysis (DEA). An anchor DMU in DEA is an extreme-efficient DMU that defines the transition from the efficient frontier to the free-disposability part of the boundary of the Production Possibility Set (PPS). In this paper, the anchor points of the PPS of the CCR model are investigated....
متن کاملیادگیری نیمه نظارتی کرنل مرکب با استفاده از تکنیکهای یادگیری معیار فاصله
Distance metric has a key role in many machine learning and computer vision algorithms so that choosing an appropriate distance metric has a direct effect on the performance of such algorithms. Recently, distance metric learning using labeled data or other available supervisory information has become a very active research area in machine learning applications. Studies in this area have shown t...
متن کاملCommon fixed points of four maps using generalized weak contractivity and well-posedness
In this paper, we introduce the concept of generalized -contractivityof a pair of maps w.r.t. another pair. We establish a common fixed point result fortwo pairs of self-mappings, when one of these pairs is generalized -contractionw.r.t. the other and study the well-posedness of their fixed point problem. Inparticular, our fixed point result extends the main result of a recent paper ofQingnian ...
متن کامل